Semantic image understanding : from pixel to word
نویسنده
چکیده
The aim of semantic image understanding is to reveal the semantic meaning behind the image pixel. We categorize semantic image understanding into two broad categories: pixel-level and image-level semantic image understanding. While pixel-level image understanding aims to obtain the semantic meaning of each pixel, image-level understanding aims to obtain the semantic meaning of the whole image, and both levels involve feature extraction and combination. In this thesis, we study semantic image understanding and have made following novel contributions: We investigated the utility of Multiple Kernel Learning (MKL) for feature combination. We introduced the concept of kernel histogram. We observed that the kernel histograms of different features are usually very different, and argued that traditional MKL’s linear kernel combination strategy is not particularly meaningful. Then we proposed the concept of Relative Kernel Distribution Invariance (RKDI) for kernel combination, and have developed a very simple histogram matching technique to achieve RKDI by transforming different kernel histograms to a canonical histogram. We have also developed two kinds of measure for automatically choosing the canonical histogram. Extensive experiments on various computer vision and machine learning datasets have shown that calibrating the kernels to a canonical histogram before they are linearly combined can always achieve a performance gain over state of the art MKL methods. For the problem of understanding image at the pixel level, we advocate the segment-then-recognize strategy. We have developed a new framework which tries to integrate semantic segmentation with low-level segmentation by introducing a semantic feature feedback mechanism. Experiments on two wellknown datasets have confirmed that our new segmentation method can indeed produce regions that are more object-consistent. Besides this, we have also developed a novel idea trying to integrate semantic segmentation with interactive segmentation. We treat the semantic segmentation module as an unreliable teacher which automatically generates tokens to guide the interactive segmentation module. Some qualitative results have shown the promise of this approach.
منابع مشابه
Modern literary interpretation in understanding the meaning of the verse ‘There is nothing like Him’
Numerous views have been expressed by commentators and writers about the literary aspect and the meaning of the Qurchr('39')anic phrase "There is nothing like Him". The sequence of the words "ka" and "like" in the holy verse, has led to two literary and semantic illusions. The literary illusion is that "ka" seems to be redundant and the semantic illusion is the word ‘like’ indirectly proves the...
متن کاملWord clustering effect on vocabulary learning of EFL learners: A case of semantic versus phonological clustering
The aim of this study is to determine the effect of word clustering method on vocabulary learning of Iranian EFL learners through a case of semantic versus phonological clustering. To this effect, 80 homogeneous students from four intermediate classes at an English institute in Torbat e Heydariyeh participated in this research. They were assigned to four groups according to semantic versus phon...
متن کاملThe evolution of the meaning of the word nurse based on the classical texts of Persian literature
Background and Aim: The semantic evolution of a word over time is inevitable, indicating a social, political, religious or cultural process. Nurse is one of the words that has a significant presence in Persian literature texts and has been used in many different meanings such as slave, servan, maid, devotee, obedient, patient and preserver. The purpose of this study is to show its semantic ev...
متن کاملLatent Semantics Local Distribution for CRF-based Image Semantic Segmentation
Semantic image segmentation is the task of assigning a semantic label to every pixel of an image. This task is posed as a supervised learning problem in which the appearance of areas that correspond to a number of semantic categories are learned from a dataset of manually labelled images. This paper proposes a method that combines a region-based probabilistic graphical model that builds on the ...
متن کاملSemantic processing survey of spoken and written words in adolescents with cerebral palsy: Evidence from PALPA word-picture matching test
Objective: The present study aimed to assess and compare semantic processing of spoken and written words in adolescents with cerebral palsy and healthy adolescents. Method: The present study is quantitative in terms of type and experimental in terms of method. Examination Group consisted 30 adolescents with cerebral palsy aged 10 to 15 years were selected by convenience sampling method. All of ...
متن کامل